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CAROTENOID KETOLASE GENES AND GENE PRODUCTS, 
PRODUCTION OF KETOCAROTENOIDS AND METHODS OF 
MODIFYING CAROTENOIDS USING THE GENES 

BACKGROUND OF THF INVENTION 
5 Carotenoids are widely distributed natural pigments that are responsible for 

many of the yellow, orange and red colors seen in living organisms. They have 
important commercial uses as coloring agents in the food industry, as feed and food 
additives, in cosmetics and as provitamin A precursors. 

The plant species Adonis aestivalis produces flowers with petals that are deep 
10 red in color and nearly black at the base of the petals due to the accumulation of 
ketocarotenoid and other carotenoid pigments (Neamtu et a!., Rev. Roum. Bioctiim. 
6:157, 1969). This pattern of carotenoid accumulation accounts for the common name 
of some varieties of this species: summer pheasant's eye. 

Among the carotenoids identified in the petals of the red petal varieties of these 
15 various species is the ketocarotenoid astaxanthin (3.3'-dihydroxy-4,4*-diketo-b,b- 
carotene; see Figure 1). Various other ketocarotenoids (see Figure 1) including 3- 
hydroxyechinenone ( 3-hydroxy-4-keto-b,b-carotene), adonimbin (3-hydroxy-4,4'-diketo- 
b,b-carotene) adonixanthin (3,3-dihydroxy-4-keto-b,b-carotene) and isozeaxanthin 
(4.4'-dihydroxy-b.b-carotene; see T.W. Goodwin, The Biochemistry of the Carotenoids, 
20 vol 1. Plants, 2nd edition. 1980, page 147) have also been reported. The latter 
compound is consistent with speculation that the 4-hydroxy may be an intermediate in 
the formation of the 4-keto group. 

SUMMARY OF THF INVFMTION 
There is appreciable interest in the biological production of carotenoids, in 

25 particular the orange-colored ketocarotenoids such as astaxanthin and canthaxanthin 
(Figure 1 ), and in the modification of carotenoid composition. For this reason, an A. 
aestivalis flower cDNA library was constructed and screened for cDNAs encoding 
enzymes (hereinafter referred to as "ketolases" although the specific biochemical 
activity has not yet been established) involved In the conversion of b-carotene into 

30 orange compounds with absorption properties similar to those exhibited by common 
ketocarotenoids such as canthaxanthin (Figure 1). Two distinctly different Adonis 
aestivalis cDNAs were obtained from among a number of cDNAs that were selected on 
this basis. 
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Thus, a first aspect of the present invention is a purified nucleic acid sequence 
which encodes for a protein having l<etolase enzyme activity and has ihb nucleic acid 
sequence of SEQ ID NO: 1 or 3. 

The invention also includes a purified nucleic acid sequence which encodes for 
5 a protein having ketoiase enzyme activity and having the amino acid sequence of SEQ 
ID NO: 2 or 4. 

The invention also includes vectors which comprise any portion of the nucleic 
acid sequences listed above, and host cells transformed with such vectors. 

Another aspect of the present invention is a method of producing a 
10 ketocarotenoid In a host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid £^ 
sequence which encodes for a protein having ketoiase enzyme activity and comprises 
(1 ) SEQ ID NO: 1 or 3 or (2) a sequence which encodes the amino acid sequence of 
SEQ ID NO: 2 or 4, wherein the heterologous nucleic acid sequence is operably linked 
15 to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing 
the ketoiase enzyme. 

Another subject of the present invention is a method of modifying the production 
of carotenoids in a host cell, relative to an untransformed host cell, the method 
20 comprising 

inserting into a host cell which already produces carotenoids a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having t 
ketoiase enzyme activity and comprises (1) SEQ ID NO: 1 or 3 or (2) a sequence which 
encodes the amino acid sequence of SEQ ID NO: 2 or 4, wherein the heterologous 
25 nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence in the host cell to 
modify the production of the carotenoids in the host cell, relative to an untransformed 
host cell. 

RRIFF DESCRIPTION OF THE DRAWINGS 
30 A more complete appreciation of the invention and many of the attendant 

advantages thereof will be readily obtained as the same becomes better understood by 
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reference to the following detailed description when considered in connection with the 
accompanying drawings. 

Figure 1 illustrates structures and biochemical routes leading from b-carotene to various 
of the ketocarotenoids referred to in the text. Conversion of p-carotene to astaxanthin 
5 by a hydroxylase enzyme (Hy) and a ketolase enzyme (keto) could proceed via any one 
or all of several possible routes depending on the order of the reactions. 

Figure 2 illustrates the beta ring structure of b-carotene and various modifications of this 
parent ring that might be produced through the action of the products of the A. aestivalis 
ketolase cDNAs. Also shown is the structure of the epsilon ring, not found to be a 
1 0 substrate for the A. aestivalis ketolases and present in carotenoids such as d-carotene. 
e-carotene, a-carotene and lutein. 

Figure 3 illustrate results obtained with TLC (thin layer chromatography) separation of 
carotenoid pigments extracted from E. co// cultures, previously engineered to produce 
b-carotene, but that now also contain the A. aestivalis ketolase cDNAs and/or other 

15 introduced genes and cDNAs. The Figure indicates the empty plasmid vector 
pBluescript SK- (SK-), the Adonis aestivalis ketolase 1 cDNA in this plasmid vector (Ad 
keto1), the Haematococcus pluvialis ketolase cDNA in this plasmid vector Hp keto), or 
the Arabidopsis p-carotene hydroxylase cDNA (At Ohase). Bands that were orange in 
color are shown here with a darker fill than those with a yellow color. Identities of 

20 various bands are indicated to the right of the band. 

Figure 4 illustrates the absorption spectrum of one of the orange carotenoids produced 
from b-carotene via the action of the Adonis ketolases and makes clear the similarity 
of the spectrum to that of canthaxanthin. Absorption spectra (in acetone) of p-carotene, 
canthaxanthin and an unknown orange product (orange band #1 ; the lower orange 
25 band in the first lane of Figure 3) extracted from cultures after introduction of the Adonis 
aestivalis keto1 cDNA (SEQ ID NO: 1) in cells of E. coli that othenA/ise produce and 
accumulate p-carotene. The absorption spectrum of the unknown resembles that of 
canthaxanthin but the compound migrates to a position below echinenone on RP18 
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TLC plates developed with a mobile phase of methanol:acetone (1 :1 by volume). The 
absorption spectrum of orange band #2 also is similar to that of canthraxanthin but it 
migrates more rapidly than canthaxanthin indicating that it is probably a more polar 
compound. 


5 Figure 5 shows SEQ ID NO: 5 (the sequence shown in this Figure includes SEQ ID NO: 

1 and also includes some of the flanking DNA from the adapator DNA and the multiple 
cloning site (MCS) of the library cloning vector, which sequences are shown in bold). 

Figure 6 shows SEQ ID NO: 6 (the sequence shown in this Figure includes SEQ ID NO: 

2 and also includes a translation of amino acids resulting from the adapator DNA and 
1 0 the multiple cloning site (MCS) of the library cloning vector and the start codon from the 

plasmid vector pTrcHis, which sequences are shown in bold and capitalized). 

Figure 7 shows SEQ ID NO: 7 (the sequence shown in this Figure includes SEQ ID NO: 

3 and also includes some of the flanking DNA from the adapator DNA and the multiple 
cloning site (MCS) of the library cloning vector, which sequences are shown in bold). 

15 Figure 8 shows SEQ ID NO: 8 (the sequence shown in this Figure includes SEQ ID NO: 

4 and also includes a translation of amino acids resulting from the adapator DNA and 
the multiple cloning site (MCS) of the library cloning vector and the start codon from the 
plasmid vector, which sequences are shown in bold and capitalized). 

Figure 9 shows a "Gap" alignment of the two Adonis ketolase sequences of the 
20 invention. A truncated version of SEQ ID NO: 1 is shown in this Figure for comparitive 
purposes, and is designated SEQ ID NO: 9. The percentage identity was calculated 
to be 91.107. 


Figure 10 shows a "Gap" alignment of SEQ ID NO: 2 and 4. The following results were 
found: 

25 Gap weight: 12 average match: 2.912 

Length weight: 4 average mismatch: -2.003 
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Quality: 1440 length: 307 . 

Ratio: 4.691 gaps: 0 

percent similarity: 92.182 percent identity: 90.228 

Figure 1 1 shows a comparison between SEQ ID NO: 2 and the Arabidopsis thaliana (3- 
5 carotene hydroxylase enzyme (GenBank U58919) (SEQ ID NO: 10). 

Figure 12A shows gDNA (SEQ ID NO: 11) immediately upstream of the cDNA of SEQ 
ID NO: 3. The sequence was obtained from a PGR product generated using the 
GenomeWalker kit of Clontech Laboratories, Inc. (1020 East Meadow Circle, Palo Alto, 
CA 94303-4230) and nested primers specific to the ketolases of Adonis aestivalis 
10 (cagaatcggtctgttctattagttcttcc (SEQ ID NO: 17) and caatttgaggaatatcaaggttccttgttctc 
(SEQ ID NO: 18)). The termination codon upstream of and in-frame with initiation 
codon (TAA at positions 204-206) is shown in bold. Initiation codon (ATG) is also 
shown in bold. 

Figure 12B (SEQ ID NO: 12) indicates that the full length polypeptide of SEQ ID NO: 
15 4 begins with the amino acids MAA (shown in bold) immediately preceding the ketolase 
sequence shown in Figure 8. A similar MAA amino acid sequence immediately 
preceding SEQ ID NO: 1 is also expected. 

Figure 13 shows an alignment of SEQ ID NO: 2, SEQ ID NO: 12, an Arabidopsis (3- 
20 carotene hydroxylase enzyme (predicted product of GenBank U58919) (SEQ ID NO: 
13), a putative second Arabidopsis hydroxylase predicted by genomic DNA sequence 
(GenBank AB025606; the exon/intron junctions were chosen with reference to the 
product of the Arabidopsis P-carotene hydroxylase cDNA u58919) (SEQ ID NO: 14), 
and two Capsicum annuum P-carotene hydroxylases (predicted products of GenBank 
25 Y09722 and Y09225) (SEQ ID NO: 15 and 16). 

DESCRIPTION OF THE PREFERRED EMBODIMFNTS 
The present invention is directed to a purified nucleic acid sequence which 
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encodes for a protein having ketolase enzyme activity and has the nucleic , acid 
sequence of SEQ ID NO: 1 or 3. 

The invention also includes a purified nucleic acid sequeiice which encodes for 
a protein having ketolase enzyme activity and having the amino acid sequence of SEQ 
5 ID NO: 2 or 4. 

Two different but closely-related nucleic acids have been isolated. The 
sequences of the longest example of each are presented herein. Sequencing which 
has subsequently been conducted of upstream genomic DMA indicates that SEQ ID 
NO: 3 lacks bases encoding the first three amino acids (MAA; see Figure 12). Likely, 

10 this is also the qase for SEQ ID NO: 1 , but the upstream genomic sequences have not 
yet been obtained for this nucleic acid. 

The two different Adonis ketolases denoted in SEQ ID NO: 1 and 3 are similar 
in sequence, sharing about 91% identity, as detemnined by the Gap program discussed 
below (see Figure 9). The predicted amino acid sequences of the enzymes denoted in 

15 SEQ ID NO: 2 and 4 share about 92% similarity and about 90% identity, also as 
determined by the Gap program (see Figure 10). 

Therefore, it is dear that certain modifications of SEQ ID NO: 1 or 3 or SEQ ID 
NO: 2 or 4 can take place without destroying the activity of the enzyme. Note also that 
certain truncated versions of the cDNAs of SEQ ID NO: 1 or 3 were found to be 

20 functional (i.e., these cDNAs retained the property of causing the conversion of b- 
carotene to orange compounds). Also, the Arabidopsis p-carotene hydroxylase 
(GenBank U58919), aligned with the ketolase SEQ ID NO: 2 in Figure 11, retains 
catalytic function when truncated to yield a polypeptide that lacks the first 129 amino 
acids (Sun et al., 1996). From the alignment in Figure 1 1 , therefore, this would suggest 

25 that the two ketolases of the invention retain catalytic activity after truncation to remove 
bases encoding the first 132 amino acids. 

Thus, the present invention is intended to include those ketolase nucleic acid 
and amino acid sequences in which substitutions, deletions, additions or other 
modifications have taken place, as compared to SEQ ID NO: 1 or 3 or SEQ ID NO: 2 

30 or 4, without destroying the activity of the ketolase enzyme. Preferably, the 
substitutions, deletions, additions or other modifications take place at those positions 
which already show dissimilarity between the present sequences. For SEQ ID NO: 1 , 
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as shown rn Figure 9. these positions are as follows: positions 7, 20, 23. 35, 53, 63, 65, 
67, 76. 78, 85, 86. 91. 107. 109-111, 135, 140, 144. 146, 160. 168, 217;-219, 241, 249. 
254, 256. 271. 291. 296. 349. 389. 400, 406. 431. 448. 449. 460. 471. 499. 530. 589. 
619, 643. 653. 654, 667, 679. 709, 731. 742. 784. 787, 836. 871, 883. 896, 911, 919. 
5 928. 930, 939. 943, 967, 969, 978. 979. 982, 988. 995. 1005. 1006, 1012-1014, 1017, 
1019-1021, 1023, 1025. 1049, 1050, 1054, 1060-1068, 1070-1073, 1075, 1094, 1100, 
1101, 1106, 1107. 1109 and 1111-1176. For SEQ ID NO: 3. as shown in Figure 9, 
these positions are as follows: positions 7, 20, 23, 35, 53. 63, 65, 67, 76, 78, 85. 86, 91. 
107. 109-111, 135, 140, 144, 146, 160. 168, 217, 219, 241, 249, 254, 256, 271, 291, 
10 296, 349, 389. 400, 406, 431 , 448, 449, 460, 471 . 499, 530, 589. 619, 643, 653, 654, 
667. 679. 709, 731 , 742. 784. 787. 836, 871 . 883, 896. 911,919. 928, 930. 939. 943. 
966, 967, 970, 979, 980, 983, 989, 996, 1006, 1007, 1013-1015, 1018. 1020-1022. 
1024, 1026, 1050, 1051, 1055, 1062-1065, 1067. 1086, 1092. 1093, 1098. 1099. 1101 
and 1103-1112. 

15 For SEQ ID NO: 2 and 4, as shown in Figure 10, the following amino acids can 

be substituted or deleted, or additions or other modifications can be made, without 
destroying the activity of the ketolase enzyme: positions 7, 8, 12, 18. 21, 22, 25. 26, 36, 
37, 45, 47-49. 56, 73. 83, 85, 97, 99, 130. 144. 150. 157, 166. 218. 244, 279, 299 and 
304. Therefore, the present invention also intends to cover amino acid sequences 

20 where such changes have been made. 

In each case, nucleic acid and amino acid sequence similarity and identity is 
measured using sequence analysis software, for example, the Sequence Analysis, Gap, 
or BestFit software packages of the Genetics Computer Group (University of Wisconsin 
Biotechnology Center, 1710 University Avenue, Madison, Wisconsin 53705), MEGAIign 

25 (DNAStar, Inc., 1228 S. Park St., Madison, Wisconsin 53715), or MacVector (Oxford 
Molecular Group, 2105 S. Bascom Avenue, Suite 200, Campbell, Califomia 95008). 
Such software uses algorithms to match similar sequences by assigning degrees of 
identity to various substitutions, deletions, and other modifications, and includes 
detailed instructions as to useful parameters, etc., such that those of routine skill in the 

30 art can easily compare sequence similarities and identities. An example of a useful 
algorithm In this regard is the algorithm of Needleman and Wunsch, which is used in the 
Gap program discussed above. This program finds the alignment of two complete 
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sequences that maximizes the number of matches and minimizes the number of gaps, 
Another useful algorithm is the algorithm of Smith and Waterman, which is used in the 
BestFit program discussed above. This program creates an optimal alignment of the 
best segment of similarity between two sequences. Optimal alignments are found by 
5 inserting gaps to maximize the number of matches using the local homology algorithm 
of Smith and Waterman. 

Conservative (i.e. similar) substitutions typically include substitutions within the 
following groups: glycine and alanine; valine, isoleucine and leucine; aspartic acid, 
glutamic acid, asparaglne and glutamine; serine and threonine; lysine and arglnine; and 

1 0 phenylalanine and tyrosine. Substitutions may also be made on the basis of conserved 
hydrophoblcity or hydrophilicity (see Kyte and Doolittle, J. MoL Biol. 157: 105-132 
(1982)), or on the basis of the ability to assume similar polypeptide secondary structure 
(see Chou and Fasman. Adv. EnzymoL 47: 45-148 (1978)). 

If comparison is made between nucleotide sequences, preferably the length of 

15 comparison sequences is at least 50 nucleotides, more preferably at least 60 
nucleotides, at least 75 nucleotides or at least 100 nucleotides. It is most preferred if 
comparison is made between the nucleic acid sequences encoding the enzyme coding 
regions necessary for enzyme activity. If comparison is made between amino acid 
sequences, preferably the length of comparison is at least 20 amino acids, more 

20 preferably at least 30 amino acids, at least 40 amino acids or at least 50 amino acids. 
It is most preferred if comparison is made between the amino acid sequences in the 
enzyme coding regions necessary for enzyme activity. J 

While the two different Adonis ketolase enzymes of the present invention are 
similar in sequence, previously-described bacterial (Misawa et ah, 1995), cyanobacterial 

25 (Fernandez-Gonzalez et al.,1997), and green algal {Haematococcus pluvialis; Lotan et 
al., 1995; Kajiwara et al., 1995) p-carotene ketolase enzymes bear little resemblance 
to the Adonis ketolases. although certain histidine motifs and features of the predicted 
secondary structure are common to the polypeptides predicted by both groups 
(Cunningham and Gantt, 1998). 

30 The present invention also includes vectors containing the nucleic acids of the 

invention. Suitable vectors according to the present invention comprise a gene 
encoding a ketolase enzyme as described above, wherein the gene is operably linked 
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to a suitable promoter. Suitable promoters for the vector can be constructed using 

techniques well known in the art (see, for example, Sambrook et aL. Mdfecular Clnnihg 

A Laborat ory Manual , Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989; 

Ausubel et aL, Current Protocols in Molecular Biology, Greene Publishing and Wiley 
5 Interscience, New York, 1991 ). Suitable vectors for eukaryotic expression in plants are 

described in Fray et aL. (1995; Plant J. 8:693-701) and Misawa et al, (1994; Plant J. 

6:481-489). Suitable vectors for prokaryotic expression include pACYC184, pUC1 19. 

and pBR322 (available from New England BioLabs, Bevery, MA) and pTrcHis 

(Invitrogen) and pET28 (Novagen) and derivatives thereof. The vectors of the present 
to invention can additionally contain regulatory elements such as promoters, repressors, 

selectable markers such as antibiotic resistance genes, etc., the construction of which 

is very well known in the art. 

The genes encoding the ketolase enzymes as described above, when cloned 

into a suitable expression vector, can be used to overexpress these enzymes in a host 
1 5 cell expression system or to inhibit the expression of these enzymes. For example, a 

vector containing a gene of the invention may be used to increase the amount of 

ketocarotenoids in an organism and thereby alter the nutritional or commercial value or 

pharmacology of the organism. A vector containing a gene of the invention may also 

be used to modify the carotenoid production in an organism. 
20 Therefore, the present invention includes a method of producing a 

ketocarotenoid in a host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 

sequence which encodes for a protein having ketolase enzyme activity and comprises 

(1) SEQ ID NO: 1 or 3 or (2) a sequence which encodes the amino acid sequence of 
25 SEQ ID NO: 2 or 4, wherein the heterologous nucleic acid sequence is operably linked 

to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing 

the ketocarotenoid. 

The present invention also includes a method of modifying the production of 
30 carotenoids in a host cell, relative to an untransformed host cell, the method comprising 
inserting into a host cell which already produces carotenoids a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
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ketolase enzyme activity and comprises (1) SEQ ID NO: 1 or 3 or (2) a sequence which 
encodes the amino acid sequence of SEQ ID NO: 2 or 4, wherein the heterologous 
nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence in the host cell to 
5 modify the production of the carotenoids in the host cell, relative to an untransformed 
host cell. 

The term "modifying the production" means that the amount of carotenoids 
produced can be enhanced, reduced, or left the same, as compared to an 
untransformed host celL In accordance with one embodiment of the present invention, 

10 the make-up of the carotenoids (i.e., the type of carotenoids produced) is changed 

a vis each other, and this change in make-up may result in either a net gain, net loss. ; 
or no net change in the amount of carotenoids produced in the cell. In accordance with 
another embodiment of the present invention, the production or the biochemical activity 
of the carotenoids (or the enzymes which catalyze their formation) is enhanced by the 

1 5 insertion of the ketolase enzyme-encoding nucleic acid. In yet another embodiment of 
the invention, the production or the biochemical activity of the carotenoids (or the 
enzymes which catalyze their formation) may be reduced or inhibited by a number of 
different approaches available to those skilled in the art, including but not limited to 
such methodologies or approaches as anti-sense (e.g., Gray et al. (1992). Plant MoL 

20 Biol. 19:69-87), ribozymes (e.g., Wegener et al (1994) MoL Gen, Genet 1994 Nov 
15;245(4):465-470), co-suppression (e.g. Fray et al. (1993) Plant MoL BioL 
22:589-602), targeted disruption of the gene (e.g.. Schaefer et al. Plant J. ; 
11:11 95-1206, 1 997). intracellular antibodies (e.g., see Rondon et al. (1 997) Annu. Rev. 
MicrobioL 51:257-283) or whatever other approaches rely on the knowledge or 

25 availability of the nucleic acid sequences of the invention, or the enzymes encoded 
thereby. 

Host systems according to the present invention preferably comprise any 
organism which is capable of producing carotenoids, or which already produces 
carotenoids. Such organisms include plants, algae, certain bacteria, cyanobacteria and 
30 other photosynthetic bacteria. Transfomiation of these hosts with vectors according to 
the present invention can be done using standard techniques. See, for example, 
Sambrook et al.. Molecular Clnning A I ahoratory Manual. Cold Spring Harbor 
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Laboratory, Cold Spring Harbor, NY, 1989; Ausubel et al.. Current Prntnnf> i< ^ j p 
Molecular Biology, Greene Publishing and Wiley Interscience, New York, 1991. 

Alternatively, transgenic organisms can be constructed which include the nucleic 
acid sequences of the present invention. The incorporation of these sequences can 
.5 allow the controlling of carotenoid biosynthesis, content, or composition in the host cell. 
These transgenic systems can be constructed to incorporate sequences which allow 
for the overexpression of the various nucleic acid sequences of the present invention. 
Transgenic systems can also be constructed which allow for the underexpression of the 
various nucleic acid sequences of the present invention. Such systems may contain 
10 anti-sense expression of the nucleic acid sequences of the present invention. Such 
anti-sense expression would result in the accumulation of the substrates of the enzyme 
encoded by the sense strand. 

Having generally described this invention, a further understanding can be 
obtained by reference to certain specific examples which are provided herein for 
15 purposes of illustration only and are not intended to be limiting unless otherwise 
specified. 

EXAMPLE 1 

iSQlati on of olant cDNAs that convert b -carotene into compounds with kfttonarntennid- 
like spectra 

20 A flower cDNA library from the plant Adonis aestivalis was introduced into a 

strain of Escherichia coli engineered to accumulate the yellow carotenoid pigment P- 
carotene (see Cunningham et al.. Plant Cell 8:1613-26, 1996). This strain of E. coli 
normally forms yellow colonies when cultures are spread on a solid agar growth 
medium. Ketocarotenoids that are derived from b-carotene, such as echinenone and 

25 canthaxanthin (Figure 1 ), are, in contrast, orange to orange-red in color. Colonies that 
were orange rather than yellow in color were visually selected, and the DNA sequences 
of the Adonis aestivalis cDNAs within the plasmid vectors contained in these colonies 
were ascertained. Two distinct cDNAs were obtained from analysis of cDNA inserts in 
plasmids obtained from approximately 10 selected colonies. The DNA sequences of 

30 these two ketolase cDNAs are presented herein. 

The products produced by the ketolases of the invention which have been 


BNSDOCID: <WO__9S6t652A1_l_> 


wo 99/61652 


PCT/US99/10455 


"12- 


expressed in a p-carotene-accumulating strain of Eschericia cofi have not yet been 
identified. As many as 5 or 6 different colored bands, in addition to the substrate P- 
carotene, may readily be discerned by C^^ TLC separation (see Figure 3). To provide 
appropriate standards to assist in identification, an H. pluvialis ketolase and an 
5 Arabidopsis P-carotene hydroxylase were separately introduced into the P-carotene- 
accumulating E. CO// to produce echinenone (3-keto-P, p-carotene) and canthaxanthin 
(3,3 -diketo-P, p-carotene) or p-cryptoxanthin (4-hydroxy-P, P-carotene) and zeaxanthin 
(4,4*-dihydroxy-p,P-carotene). None of the compounds formed in the presence of the 
ketolases of the invention (no difference was observed in products formed in the 

10 presence of the two different nucleic acid sequences of the invention) both migrate in 
the TLC system and have the absorption spectrum expected for echinenone, / 
canthaxanthin, P-cryptoxanthin, or zeaxanthin. Two of the colored TLC bands produced 
in the presence of the Adonis ketolase cDNAs are orange in color. Orange band #1 
has an absorption spectrum similar to that of canthaxanthin (see Figure 4) but migrates 

15 in a position that indicates a polarity intermediate to echinenone and p-carotene. 
Orange band #2 also has an absorption spectrum like that of canthaxanthin but 
migrates in a position that indicates a polarity intermediate to canthaxanthin and 
zeaxanthin (see Figure 3). The absorption spectra and TLC results suggest that the 
two orange products could be desaturated at the 3-4 positions of both rings (3,4,- 

20 didehydro; see Figure 2). Orange band #1 (see Figure 3) might then be 3,4,3',4'- 
tetradehydro-p, P-carotene. To substantially affect the absorption spectrum of the 


substrate P-carotene, any modifications very likely involve a carbon that lies in 


conjugation with the conjugated chain of carbon-carbon double bonds that constitute 
the chromophore (Goodwin, 1980; The Biochemistry of the Camtenoids ^ volume I; 2"^ 

25 edition. Chapman and Hall). For the spectra obtained, only the carbons at the number 
4 position of the two rings appear to be plausible locations for modification. The 
multitude and TLC migrations of the yellow and orange products produced from the 
symmetrical P-carotene, however, also indicates that the enzymes of the invention carry 
out more than a single type of reaction. The apparent homology of the ketolases of the 

30 invention to the Arabidopsis P-carotene hydroxylase would suggest that compounds 
with a hydroxyl at the 3 and/or 4 positions of one or both rings are another possible 
outcome (see Figure 2). In fact, such compounds have been identified in Adonis (see 
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above), and it has long been conjectured that a hydroxyl at position 4 is an intermediate 
in the formation of the4-l<eto (e.g. crustaxanthin, a 3,3',4,4' tetrahydroxy ca'rotenoid that 
might be a precursor for astaxanthin in the exoskeleton of the lobster). The histidine 
motifs and secondary structure In common to the hydroxylase and ketolase enzymes 
5 are characteristics of a large group of di-lron oxygenases whose members also include 
examples of desaturases (J. Shanklin, 1998, Ann. Rev. Plant Physiol. Plant Mol. Biol.), 
therefore a 3-4 desaturatlon (and/or perhaps a 2-3 desaturation In one or more of the 
yellow compounds) would also seem a plausible outcome. 

To summarize the results of this example for the Adonis ketolases of the 

10 . invention, a number of different carotenoids, including two with ketocarotenoid-like 
spectra, are produced from P-carotene via the action of the products of either of the two 
different nucleic acids of the invention. These orange compounds appear to be the 
major products. Truncation and fusion of the cDNAs to a stronger promoter in the 
vector pTrcHis (Invitrogen) was detrimental to growth of E. coli but did result in 

15 improved yield of the most polar orange product (orange band #2 in Figure 3). 
Introduction of a cyanobacterial ferredoxin did not change the yield or relative amounts 
of the various products. Without being bound by theory. It may be that the 
ketocarotenoids produced in flower petals of Adonis actually include the as yet 
unidentified orange compounds that are produced in E. coli using the nucleic acids of 

20 the invention. 

FXAMPI F 9 
Substrate spftcificity of the Adonis kfttnlasfts 

Carotenoids with e rings are common in plants. The e ring differs from the b ring 
only in the position of the double bond within the ring (Figure 2). The e ring is reported 

25 to be a poor substrate for the Arabldopsis b-carotene hydroxylase (Sun et al., 1996). 
The Adonis ketolase cDNAs were introduced into strains of E. coli engineered 
(Cunningham et al., 1996) to accumulate carotenoids with one or two e rings (d- 
carotene and e-carotene), or the acyclic carotenoid lycopene. TLC analysis of acetone 
extracts revealed that these carotenoids were not modified by the Adonis ketolases. 

30 as indicated by a lack of any new products formed. Products produced in E. coli 
engineered to accumulate zeaxanthin (Sun et ai., 1996) appeared to be the same as 
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for p-carotene accumulating cultures indicating that a 3-OH is likely to be one. of the 
functional groups introduced to the b ring by the Adonis ketolases. The more polar 
orange band produced from b-carotene through the action of the Adonis ketolases (e.g., 
orange band 2 in Figure 3), therefore, could very well be 3,3'-dihydroxy-3,4,3\4- 
5 tetradehydro-b,b-carotene. 

The references cited in the application, along with the following references, are 
incorporated by reference: 

Bouvier F, et al. (1998) Xanthophyll biosynthesis: molecular and functional 
characterization of carotenoid hydroxylases from pepper fruits (Capsicum annuum L.). ^ 
1 0 Biochim Biophys Acta. 1 391 :320-8 

Breitenbach J, et al. (1996) Expression in Escherichia coli and properties of the 
carotene ketolasefrom Haematococcus pluvialis. FEMS Microbiol Lett. 140:241-6 

Cunningham FX Jr. Gantt E (1998) Genes and enzymes of carotenoid biosynthesis in 
plants. Ann Rev Plant Physiol Plant Mol Biol 49: 557-583 

1 5 Femandez-Gonzalez B, et al. (1 997) A new type of asymmetrically acting beta-carotene 
ketolase is required for the synthesis of echinenone in the cyanobacterium 
Synechocystis sp. PCC 6803. J Biol Chem. 272:9728-33 ) 

Eraser PD, et al. (1997) In vitro characterization of astaxanthin biosynthetic enzymes. 
J Biol Chem. 1997272:6128-35 

20 Fraser PD, et al. (1998) Enzymic confirmation of reactions involved in routes to 
astaxanthin formation, elucidated using a direct substrate in vitro assay. Eur J Biochem. 
252:229-36 

Barker M. et al. (1997) Biosynthesis of ketocarotenoids in transgenic cyanobacteria 
expressing the algal gene for beta-C-4-oxygenase, crtO. FEBS Lett. 404:129-34 
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Kajiwara S. et al. (1995) Isolation and functional identification of a novel cDNA for 
astaxanthin biosynthesis from Haematococcus pluvialis, and astaxanthih synthesis in 
Escherichia coli. Plant Mol Biol. 29:343-52 

Lotan T, et al. (1995) Cloning and expression in Escherichia coli of the gene encoding 
beta-C-4-oxygenase, that converts beta-carotene to the ketocarotenold canthaxanthin 
in Haematococcus pluvialis. FEBS Lett. 364:125-8 

Misawa N. et al. (1995) Canthaxanthin biosynthesis by the conversion of methylene to 
keto groups in a hydrocarbon beta-carotene by a single gene. Biochem Biophys Res 
Commun.209:867-76 

Misawa N, et al. (1995) Structure and functional analysis of a marine bacterial 
carotenoid biosynthesis gene cluster and astaxanthin biosynthetic pathway proposed 
at the gene level. J Bacteriol. 177:6575-84 

Miura Y, et al. (1998) Production of the carotenoids lycopene, beta-carotene, and 
astaxanthin in the food yeast Candida utilis. Appl Environ Microbiol. 64:1226-9 

Shanklin J, et al. (1997) Mossbauer studies of alkane omega-hydroxylase: evidence for 
a diiron cluster in an integral-membrane enzyme. Proc Natl Acad Sci USA. 94:2981-6 

Shanklin J, Gaboon EB (1998) Desaturation and related modifications of fatty acids. 
Ann Rev Plant Physiol Plant Mol Biol 49: 61 1-641 

Wang CW, et al. Engineered isoprenoid pathway enhances astaxanthin production in 
Escherichia coli. Biotechnol Bioeng. 1999 Jan 20;62(2):235-41. 
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I claim: 

1 . A method of producing a ketocarotenoid in a host cell, the" method comprising 

inserting into the host cell a vector comprising a heterologous nucleic add 
sequence which encodes for a protein having ketolase enzyme activity and has the 
5 nucleic acid sequence of SEQ ID NO: 1 or 3, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing 
the ketocarotenoid. 

2. The method of claim 1, wherein the host cell is selected from the group 
10 consisting of a bacterial cell, an algal cell and a plant cell. 

3. A method of producing a ketocarotenoid in a host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having ketolase enzyme activity and has a 
sequence which encodes the amino acid sequence of SEQ ID NO: 2 or 4, wherein the 
15 heterologous nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing 
the ketocarotenoid, 

4. The method of claim 3, wherein the host cell is selected from the group . J 
consisting of a bacterial cell, an algal cell and a plant cell. 

20 5. A method of modifying the production of carotenoids in a host cell, relative to an 
untransformed host cell, the method comprising 

inserting into a host cell which already produces carotenoids a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
ketolase enzyme activity and has the nucleic acid sequence of SEQ ID NO: 1 or 3, 
25 wherein the heterologous nucleic acid sequence is operably linked to a promoter; and 
expressing the heterologous nucleic acid sequence in the host cell to 
modify the production of the carotenoids in the host cell, relative to an untransformed 
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host cell. 

6. The method of claim 5, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

7. A method of modifying the production of carotenoids in a host cell, relative to an 
5 untransformed host cell, the method comprising 

inserting into a host cell which already produces carotenoids a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
ketolase enzyme activity and has a sequence which encodes the amino acid sequence 
of SEQ ID NO: 2 or 4, wherein the heterologous nucleic acid sequence is operably 
10 linked to a promoter; and 

expressing the heterologous nucleic acid sequence in the host cell to 
modify the production of the carotenoids In the host cell, relative to an untransformed 
host cell. 

8. The method of claim 7, wherein the host cell is selected from the group 
15 consisting of a bacterial cell, an algal cell and a plant cell. 

9. A purified nucleic acid sequence which encodes for a protein having ketolase 
enzyme activity and has tl;ie nucleic acid sequence of SEQ ID NO: 1. 

10. A purified nucleic acid sequence which encodes for a protein having ketolase 
enzyme activity and has the nucleic acid sequence of SEQ ID NO: 3. 

20 11. A purified nucleic acid sequence which encodes for a protein having ketolase 
enzyme activity and has a sequence which encodes the amino acid sequence of SEQ 
ID NO: 2. 

12. A purified nucleic acid sequence which encodes for a protein having ketolase 
enzyme activity and has a sequence which encodes the amino acid sequence of SEQ 
25 ID NO: 4. 
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IS. A vector which comprises the nucleic acid sequence of any one of clainns 9-1 2, 
wherein the nucleic acid sequence is operably linked to a promoter. 

14. A host cell which is transformed with the vector of claim 13. 

15. The host cell of claim 14, wherein the host cell is selected from the group 
5 consisting of a bacterial cell, an algal cell and a plant cell. 

16. The host cell of claim 14, wherein the host cell is a photosynthetic cell. 

17. The host cell of claim 14, wherein the host cell contains a ketocarotenoid. 

18. The host cell of claim 14, wherein the host cell contains modified levels of 
carotenoids, relative to an untransformed host cell. 

10 19. A purified ketolase enzyme which is encoded by the amino acid sequence of 
SEQ ID NO: 2. 

20. A purified ketolase enzyme which is encoded by the amino acid sequence of 
SEQ ID NO: 4. 
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Figure 5 [SEQ ID NO: 5] 

_23 999 ctgcaggaat tcggcacgag 

1 agcaauc-ca gtgttcagta caagttattc tttccacaag aatctcttgt 

51 tgcacrcaaa acaagacatt ctcaaccgcc catgtttgct cttctctcca 

101 gttgcggrgg agtcgcctat gagaaagaaa aagacacatc gtgctgcacg 

151 tatctgctct gtcgcagaga gaacaaggaa ccttgatatt cctcaaattg 

201 aagaagacga agagaacgag gaagaactaa cagaacagac ggattctggc 

251 acaarzratia caaagaaaac gctagggggg aaacaancaa gacggtccac 

301 tggcrccazz gtcgcacccg tatcctgccr cgggaucett: tcaatgaccg 

351 gaccrgcrgz rtiactucaag ttttcacggc caacggagcg tggagatatt 

401 cctgtcgcag aaatggggat tacgtttgcc gcctccgncg ctgctgcgat 

451 tggcacggaa rttrtgtcag gatgggttca caaagaactc tggcacgatt 

501 crtrgnggca cattcacaag tctcaccata ggccacgaaa aggccgcttc 

551 gagt-caa-g angtgttitgc tattattaac gcgcttcctg ctattgctct 

601 tatcaattiat: ggattcrcaa atgaaggcct ccttccngga gcctgctttg 

651 gtaccggzcr cggaacgaca gtctgtggca tggcttiacat: ttttcttcac 

701 aatggcczrc cacaccgaag gttcccagta gggcttatcg caaacgtccc 

751 ttazt-zccac aagctggctg cagctcacca aatccatcac tcaggaaaat 

801 ttcagggcgz accatttggc ctgttccttg gaccccagga attggaagaa 

851 gtaagaggag gcactgaaga attggagagg gtgatcagtc gtacagctaa 

901 acgaacgcaa ccatctacat gaatcaactc ttttacattt atgaggtttt 

951 agtttatcgg tgttacaagt cacacatttg tgtcgttgta gtaattcaaa 

1001 gttaccazac tcttttttag aatttttttt tgatgtatag gtcgcggagt 

1051 tacggctaca aaggccaaat ctattgttgt ggaattccat tattaaaaat 

1101 aaaaauuaga -gtttgtagtt ttatctggtg atcaatatca atatatatta 

1151 attaaagcaa aaaaaaaaaa aaaaaa ctcgag 
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Figure 6 [SEQ ID NO: 61 

MGI.QEFGTR 

aisvfstsys fhknlllhsk qdilnrpcll fspwvespm rkkJcthraac 
icsvaertrn Idipqieeee eneeelieqt dsgiihikkt Iggkqsrrst 
gsivapvscl gilsmigpav yfkfsrlmec gdipvaemgi tfaafvaaai 
gteflsgwvh kelwhdslwy ihkshhrsrk ' grf ef ndvf a iinalpaial 
inygfsnegl Ipgacfgtgl gttvcgmayi flhnglshrr fpvglianvp 
yfhklaaahq ihhsgkfqgv pfglflgpqe leevrggtee iervisrtak 
rtqsst* 
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Figure 7 [SEQ ID NO: 7] 


-23 



ggg 

ctigcaggaat: 

t.cggcacgag 


aacaat ti tea 

atcrttcaatt 

caggttattc 

tttctacaag 

aatctcttgt 



accaaatat t 

ctcaaacccc 

catgcctgct 

attctctcca 

101 

A w J» 

at tat era tea 

tgtcgcctat 

gagaaagaaa 

aagaaacatg 

gtgatccatg 

X 3 X 

t" at c^tact cc 

L» CI \^ ymm^ w w_ Www 

cttacaaaaa 

gaacaaggaa 

ccttgatatt 

cctcaaattg 

^01 

^ w X 


aaaoaatata 

gaagaactaa 

tagaacagac 

cgattctgac 


d wd^ l.^^ w Cl L. Cl 


actaaaaaoa 

ciw ^c*y y 3333 

aaacaatcaa 

aacggcccac 

J U X 

t* CTO t" ^ L 

Wp4 ^ri^ W& 

tat cc wgtct 

tgggatcctt 

tcaatgattg 

O -5 -1. 


Li> CL ^>i> U> WW 


taatggaggg 

tggagatata 

*t U X 


a a 1" ocraa a t 
cL CI Q L» ^ y 

^ w4 Wp" 

acctttgttg 

ctgctgctgt 

*i J X 


t" t~ t tcrt caa 

w w w y w w 

cataggttca 

caaagaactc 

tggcacaagt 

3 U X 


cat t cacaaa 

tctcaccatc 

QQtcacgaaa 

aggccgcttc 

^ ^ X 

ciacTt t f^aa ta 

atatatttac 

tattartaac 

gcgcttcccg 

ctattgctct 

w w X 


aaattctcca 

atgaaggcct 

ccttcctgga 

gcgtgctttg 

651 

O W X 

otat caat ct 

tggaacaaca 

gtctgtggta 

tggcttacat 

ttttcttcac 

701 

aatggcctat 

cacaccgaag 


tggcttattg 

cgaacgtccc 

751 

ttatttccac 

aagctggctg 

cage wwaccd 


t* r^aoaaaaat 

801 

ttcagggtgt 

accatttggc 

ctgttccttg 

gacccaagga 

attggaagaa 

851 

gtaagaggag 

gcactgaaga 

gttggagagg 

gtaatcagtc 

gtacaactaa 

901 

acgaacgcaa 

ccatctacct 

gaatcaattt 

ttttacatat 

ataaggtttt 

951 

agtttatcgg 

tgttataaaa 

tcacacatcc 

gtatcgtttt 

agtaagtcaa 

1001 

agttaagata 

cttccttctt 

agaatatttt 

ttgatgtata 

ggtcgcggat 

1051 

atactgttac 

actattcgtt 

gtggaattcc 

attataaaaa 

aataaaaaaa 

1101 

aaaaaaaaaa 

aa ctcgag 
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Figure 8 [SEQ ID NO: 8] 

MGLQEFGTR 

aisvfssgys fyknilldsk pnilkppcll fspwimspm rkkkkhgdpc 
icsvagrtrn Idipqieeee enveelieqt dsdivhikkt Iggkqskrpt 
gsivapvscl gilsmigpav yfkfsrlmeg gdipvaemgi tfatfvaaav 
gteflsawvh kelwheslwy ihkshhrsrk grfefndvfa iinalpaial 
inygfsnegl Ipgacfgvgl gttvcgmayi flhnglshrr fpvwlianvp 
yfhklaaahq ihhsgkfqgv pfglflgpke leevrggtee lervisrctk 
rtqpst* 
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Figure 9: Gap of SEQ ID NO: 9 and SEQ ID NO: 3 
1 agcaatctcagtgttcagtacaagttattctttccaca^gaatctcttgt 50 

mill iiMiiiiiiii II iiMiiriiii iiiiiiiiiiiiiii 

1 agcaatttcagtgttcagttcaggttattctttctacaagaatctcttgt 50 
51 tgcactcaaaacaagacattctcaaccgcccatgtttgctcttctctcca 100 

II MM Mill I I iiiiiiin Mill I nil I I mill I 

51 tggactcaaaaccaaatattctcaaacccccatgcctgctattctctcca 100 

^ , « • 

101 gttgcggrggagrcgcctatgagaaagaaaaagacacatcgtgctgcatg 150 

lllliri" IIIIIIIIIIMMIIMIMII Mil 111 I nil 

101 gttgtgat.c2r.gtcgcctatgagaaagaaaaagaaacatggtgatccatg 150 
151 tat.ctgct:=rgt:tgcagagagaacaaggaacctitgatat.tcct.caaattg 200 

linillM llillll IIMIIIilllllllMIIMIIIIIIIIIII 

151 tatctgctccgttgcagggagaacaaggaaccttgatattcctcaaattg 200 

. • • 

201 aagaagaggaagagaacgaggaagaactaatagaacagacggattctggc 250 

I MM II I! II III 1 1 I IIMIIIIIIII MM II IN IHMM I 

201 aagaagagcaagagaatgtggaagaactaatagaacagaccgattctgac 250 
251 ataattcarataaagaaaacgctaggggggaaacaatcaaigacggtccac 300 

Ml I IMIIIIMIIIII IIMMIIIIIMIIMII I 111 MM 

251 atagtgcazacaaagaaaacactaggggggaaacaatcaaaacggcccac 300 

301 tggctccaztgtGgcacccgtatcttgtcttgggatcctttcaatgatcg 350 

I I I I I M 11 II I I II I I 1 1 i 1 1 1 1 1 M 1 1 1 M II M II I I I I I 1 I I 1 I I 
301 tggctccacrgtcgcacccgtatcttgtcttgggatcctttcaatgattg 350 
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Figure 9 (cont.) 

^ • • • 

351 gacctgctgtctacttcaagttttcacggctaatggagtgtggagatatt 400 

I I I I I I M I I I I I I I I I I I I t I I I I M I III I I I I I I I I II 11 M II I 
351 gacctgctgtctacttcaagttttcacggctaatggagggtggagatata 400 

, • • 

401 cctgtcgcagaaatggggattacgtttgccgcctttgttgctgctgcgat 450 

mil llllllliliMMMIIMIIII IIIIIIIIMMIIM I 

401 cctgtagcagaaatggggattacgtttgccacctttgttgctgctgctgt 450 
451 cggcacggaarctttgccaggatgggtrcacaaagaaccctggcacgact 500 

IIIIIIMI lililMIM IIIIIIIIIIIIIIIIIIMMIIIM I 

451 tggcacggagtttttgtcagcatgggttcacaaagaactctggcacgagt 500 

501 ccttgtggtacattcacaagtctcaccataggtcacgaaaaggccgcttc 550 

I I I i I I ! I II I 11 III M I I I I I I I I II I 11 II M I I III II i 11 II II 
501 ctttgcggtacattcacaagtctcaccatcggtcacgaaaaggccgcttc 550 

551 gagttcaatgatgtgttcgctattattaacgcgcttcctgctat'tgctct 600 

IllllMIIIMlllllllMIIIIIIMIIIIIIIII IIIMMIMI 

551 cagttcaatgatgtgtttgctattattaacgcgcttcccgctattgctct 600 

. ♦ • 

601 tatcaattatggattctcaaatgaaggcccccttcccggagcctgctttg 650 

lillllllllllllllll illllllllllllllllllllll lllllll 
601 tatcaattatggattctccaatgaaggcctccttcctggagcgtgctttg 650 

651 gtaccggtcttggaacgacagtctgtggcatggcttacatttttcttcac 700 

II IIIIIMIIIII llllillllll llllllllllillllMIIll 

651 gtgtcggtcttggaacaacagtctgtggtatggcttacatttttcttcac 700 
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Figure 9 (cont.) 

7 01 aatggcctttcacaccgaaggttcccagtagggcttattgcaaacgtccc 750 

iiiiiiii iiiiiiiiiiiiiiiiiiiii'iiiiiiiiii iiiiiiii 

701 aacggcctatcacaccgaaggttcccagtatggcttattgcgaacgtccc 750 

• • • • 

751 ttatttccacaagctggctgcagcucaccaaatccatcactcaggaaaat 800 

I 1 1 1 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 

751 ttatttccacaagctggcrgcagctcaccaaatacaccactcaggaaaat 800 

801 tincagggrgcaccacttggcctgt-ccttggaccccaggaatcggaagaa 350 

lllllllllllllllllllllllliilllllllll MMIIIIIIIIII 

801 ttcagggtgtaccatttggcccgttccttggacccaaggaattggaagaa 850 
851 gtaagaggaggcactgaagaattggagagggtgatcagtcgtacagctaa 900 

iiiiiiiiiiiiiiiiiiii III III III M II II nil II II nil 

851 gtaagaggaggcactgaagagttggagagggtaatcagtcgtacaactaa 900 
901 acgaacgcaatcatctacaTGAatcaacccttttacatttatgaggtttt 950 

II ill II II I I II II II IIIIIIII I inn II I Hi 11 nil I 

901 acgaacgcaaccatctaccTGAatcaatttctttacatatataaggtttt 950 
951 agtttatcggtgtta.caagtcacacatttgtgtcgttgta'gtaattcaa 999 

iiiiiiiiiiiiiii II iiiiiiii II iiiii niin nil 

951 agtttatcggtgttataaaatcacacatccgcatcgttttagtaagtcaa 1000 

1000 agttaccatactcttttttagaattttttcttgatgtataggtcgcggag 1049 

inn mil ii i i i nnnninninnniii 

1001 agttaagatacttccttcttagaatattttttgatgtataggtcgcggat 1050 
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Figure 9 (cont.) 

• • • * * 

1050 ttacggctacaaacgccaaatctattgttgtiggaattccattiattaaaaa 1099 

1 1 1 Mill I i 1 1 1 i M I M I I I 1 1 1 1 i I Mill 

1051 atactgtitac actattcgttgtggaattccactataaaaaa 1091 

1100 taaaaattagagtccgtagtttuacctggtgaccaatatcaatatatatt: 1149 

nil I I 

1092 ataaaaaaaaaaaaaaaaaaa 
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Figure 10: Gap of SEQ ID NO: 2 and SEQ ID NO: 4 

• • • • 

1 AISVFSTSYSFHKNLI^SKQDIIJSrRPCLLFSPVVVESPMRKKKTHRAAC 50 

Mllil. ilhilllMI -11 Mliillih illilll I I 

1 AISVFSSGYSFYKNLLLDSKPNILKPPCIiLFSPWIMSPMRKKKKHGDPC 50 
51 iCSVAERTRIJIiDIPQIEEEEENEEELIEQTDSGIIHIKKTLGGKQSRRST 100 

lllll Mliililllllllll iMIIIIII hllilllllllhl I 

51 ICSVAGRTRNU3IPQIEESEENVEELIEQTDSDIVHIKKTLGGKQSKRPT 100 
101 GSIVAPVSZLGILSMIGPAVYFKFSPJ^KECGDIPVAEMGITFAAFVAAAI 150 

I i I M M : : ! 1 1 1 1 1 M 1 1 1 1 1 1 1 1 i I ! ! ! I M 1 1 1 1 1 1 1 I I I 1 1 i h 

101 GSIVAPVSCLGILSMIGPAVYFKFSRLM2GGDIPVAEMGITFATFVAAAV 150 

» • • • " 

151 GTEFLSGW\--IKELWHDSLWYIHKSHHRSRKGRFEFNDVFAIINALPAIAL 200 

1 1 1 1 1 I ! : I I 1 i I I : I I I I I I I I I ! I i ! I I I I I I I 1 1 M I i I I I M I i I 
151 GTEFLSAW\--iKELWKESLWYIHKSHHRSRKGRFEFNDVFAIINAL,PAIAL 200 

. • • • 

201 INYGFSNZGLLPGACFGTGLGTTVCGMAYIFLHNGLSHRRFPVGLIANVP 250 

lllllliiilllllMI IIIMIIIIIilMllMlilMli nil II 

201 INYGFSNEGIiLPGACFGVGLGTTVCGMAYIFLHNGLSHRRFPVWLIANVP 250 
251 YFKKIAAAHQIHKSGKFQGVPFGLFLGPQELEEVRGGTEELERVISRTAK 300 

IIIIIIMIIIIIIIIIIillMMIIMIlMlillllllilllM I 

251 YFHKLAAAHQIHKSGKFQGVPFGLFLGPKELSEVRGGTEELERVISRTTK 300 

301 RTQSST* 307 • 

III III 
301 RTQPST* 307 
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Figure 1 1 : Gap of SEQ ID NO: 2 and Arabidopsis p-carotene hydroxylase (SEQ 
ID NO: 10) 

• • • • • 

1 AISVrSTSYSFHKNLLLHSKQDILNRPCLLFSPVVVESPMRKKKTH^ 50 

. II. .1 II • .11. : I . 

1 MAAXLS7AVTFKP. . .LHRSFSSSSTDFRLRLPKSLSGFSPSLRFKRFSV 47 

, • • • • 

51 ICSVAERTR^^LDIPQIEEEEENEEELIEQTDSGIIHIKKTLGGKQSRRST 100 

I! I . I I II : : : • I M III 

4 8 CYV\^ERRQNSPIENDERPESTSSTNAIDAEyiJU:iRIAEKIiERK^ 97 

101 GSIVAPVSCLGILSMIGPAVYFKFSRLMECGDIPVAEMGITFAAFVAAAI 150 

n .1 II II llh:|l Ilhi • M Ml III: 
98 yLIAAMLSSFGITSMAVMAVYYRFSWQMEGGEISML.EMFGTFALSVGAAV 147 

• • • • • 
151 GTEFLSGWVHKELWHDSLWYIHKSHKRSRKGRFEFNDVFAIINALPAIAL 2 00 

I II . i |: ill III -Mih l-i II llillhli III I 
148 GMEFWARWAHRMjWHASLWNIffliSHHKPRSGPFEIjNDVFAIVNAGPAIGL 197 

• • • • 

201 INYGFSNEGLiPGACFGTGLGTTVCGMAYIFLHNGLSHRRFPVGLIANVP 250 

:.ill MMI Ml III 11 MMMMI hlllll IMI 

198 LSYGFFNKGLVPGLCFGAGLGITVFGIAYMFVHDGLiVHKRFPVGPIADVP 247 

251 YFHKLAAAHQIHHSGKFQGVPFGLFLGPQELEEVRGGTEELERVISRTAK 300 

I Millhil- II llhlllMMMII II III:: III 1 
248 YLRKVAAAHQLHHTDKFNGVPYGLFIiGPKELEEV.GGNEELDKEISRRIK 296 

301 RTQSST* 307 

297 SYKKASGSGSSSSS* 311 
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Figure 12A (SEQ ID NO: 11) 

1 CATACCATAA ATAGTAGAGG ACAACCTACA AACCAACCAC CAGAAACCTC 50 
51 CAATGGCAGC 


Figure 12B (SEQ ID NO: 12) 

MAAAI SVF SSGYSF YKJjliLLDSKPNILKPPCLLFS PWIMS PMRKXKIC-^^ 

TPNI^I?QIEEZEE2rviEI.IEQTDSDIVHZ:<:<ri.GGKQSKRPTGSIVA?^ 

PAVYFKF 3 PJLMEGGD I PVAEMG ITFAT— /AAAVGTZFLS AWVKKZLWHE3I.WYZ:-::-<:^ HHR 

SRKGRFZFNDVr AI INALPAIAI^INYGFSNZGLI.PGACFGVGI.GTTVCGMAYZ FI.HNGLS 

KRRFP-^/WLIANVPYFKKIAAAKQIHKSGKFQGVPFGLFLGPKZ 

TTKRTQPST* 
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Figure 1 3 


Cal 
Ca2 
AdJCl 
AdK6 


20 ♦ 40 ♦ 60 

-MAAXl.STAVTPKPI.HRSFSSSSTDFRIJlLPlCalagfspslR 

-MAAGI^STIAVTUCPLNRSSPSANHPls tavf ppslRFNGFRR 


-M?UlEISISASSRAICI#QiaiPPPAPinrFA^APpllf fapltCNLDAIIASRlUcpr iaa 


TTGRYinrQLVWCQISFSSTSRTSYYRHSPPLGPKPTPTTPSVYpitpfapillGSIIJtC aUl rps 

aisvf3TS^Hphicni.iJ!hskqd1lnrt>Bii f spwBssBMRJuatT - hr a. 


'SnPUXi7LLSHSKQClliKRplll£spvv9ssAaUUU&T- 
GaaFYKXrLLpSKPligiiKPpgl l f spwgHSgKRJUUUL* 


-HAAAXSVPSSGggPYKXTLLgD; 
m 


FaP 



AHl 

Afc2 

Cal 

Ca2 

AdKl 

AdK6 


80 * 100 
: errqNSPIENDBRPESTSSTNAIDAEYULL- - - 
I orkqSSPMDDDNICPESTTSSSEILMTS- 
s ddklYTAQSGKQSDTEAIGDEIEVETNEEKSIAVj 
ddktKTQFEAGgSDIK KKISE QISAT 

iQ9SE££EKE2ES|BQTDSGZI « 
QgHEBEEHVEEgEQTDSDIV*' ] 


160 



' 180 
isvg - aa vg'^e f 1 


tf a JaVgaawg|M|ef I 
f a 3af g - aaiggeyj 
:a Javg - aavgfl^ f | 
|t f aaSva- aaigSaf: 
t fatjva - aavgja f ! 
TPA V Aa G S£ 



220 

f alndvfaxvnagpai : 195 

afaliidvfaltxuiivpal : 194 

folndlfaxiziavpal : 209 

f alxidvfaixziavpal : 210 

fafndvf alxnalpai : 198 

£a£zidv€alxxialpa± s 201 
G PB in>vPAX NA PAX 


• 240 

Atl X gllayg£fxsBglvpgJjcf! 
At2 : gllyygfla&lvpg |c£i 
Cal : a££a£g£xit&li.pglc£gggl 
Ca2 : alldyg££hmlipgflc£gSgl 
AdKl s alinyg£3xilgllpg|c£gtgl 
AdK6 : alinyg£9ziigllpg3c£gvgl9| 
1 yGF GL PG CFG GI.G 


JtvBgiay*! 
Iggtan ^ gmay< 
' "tv j gmayfc; 
tv i ginay9: 
tvjgmayl: 
tvggsay] 
Tv GmAY 



♦ 280 

|r fpvgHiadvpy Ir kSaaalii 
f pvgfl lajxvpylr3cfflaaaiii 
fpvgS ia3cvpy£qrSaaaln,^ 
|r £pvgj« vanvpyls'kSaaaliaj 
r £pvg3 1 amvpy £]lk9&aj 
{r £p wglanvpy £h34|aaj 
RFPVg xA VPY k AAAHq 



300 * 
Atl : gl£lgpkaleevgg-nael 
At2 : gIflgpkqevoovgGkeal 
Cal : glflgpkalaovgv-ioal 
Ca2 : gl£lgpkelaevgg-laal< 
AdKl t gl£lgpqelettv|gGtaal 
AdX6 : gl£lgpkalaev^Gtaal 
GXiFX.GPkal«Bv g 


320 * 
^iarQiksykkaSGSGSSSSS 
jrliJdynkgSSTS 



Pikalkrl-- 
tryikga - - 
jak9|qa8T- 

|tk3<2P'^' 
k 


310 
305 
315 
316 
306 
309 
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SEQUENCE LISTING 

<110> CUNNINGHAM, Francis X. 


<120> CAROTENOID KETOLASE GENES AND GENE PRODUCTS, PRODUCTION 
OF KETOCAROTENOIDS AND METHODS OF MODIFYING CAROTENOIDS 
USING THE GENES 

<130> 8172-9022 


<140> Unknown 
<141> 1999-05-21 

<150> 60/086,460 
<151> 1998-05-22 

<160> 18 

<170> PatentIn Ver. 2.0 

<210> 1 
<211> 1176 
<212> DNA 

<213> Adonis aestivalis 


<400> 1 

agcaatctca gtgttcagta caagttattc tttccacaag aatctcttgt tgcactcaaa 60 

acaagacatt ctcaaccgcc catgtttgct cttctctcca gttgtggtgg agtcgcctat 120 

gagaaagaaa aagacacatc gtgctgcatg tatctgctct gttgcagaga gaacaaggaa 180 

ccttgatatt cctcaaattg aagaagagga agagaacgag gaagaactaa tagaacagac 240 , 

ggattctggc ataattcara taaagaaaac gctagggggg aaacaatcaa gacggtccac 300 

tggctccatt gtcgcacccg tatcttgtct tgggatcctt tcaatgatcg gacctgctgt 360 

ttacttcaag ttttcacgcc taatggagtg tggagatatt cctgtcgcag aaatggggat 420 

tacgtttgcc gcctttgtug ctgctgcgat tggcacggaa tttttgtcag gatgggttca 480 

caaagaactc tggcacgart ctttgtggta cattcacaag tctcaccata ggtcacgaaa 540 

aggccgcttc gagttcaarg atgtgtttgc tattattaac gcgcttcctg ctattgctct 600 

tatcaattat ggattctcaa atgaaggcct ccttcctgga gcctgctttg gtaccggtct 660 

tggaacgaca gtctgtggca tggcttacat ttttcttcac aatggccttt cacaccgaag 720 

gttcccagta gggcttattg caaacgtccc ttatttccac aagctggctg cagctcacca 780 

aatccatcac tcaggaaaat ttcagggtgt accatttggc ctgttccttg gaccccagga 840 

attggaagaa gtaagaggag gcactgaaga attggagagg gtgatcagtc gtacagctaa 900 

acgaacgcaa tcatctacat gaatcaactc ttttacattt atgaggtttt agtttatcgg 960 

tgttacaagt cacacatttg tgtcgttgta gtaattcaaa gttaccatac tcttttttag 1020 

aatttttttt tgatgtatag gtcgcggagt tacggttaca aaggccaaat ctattgttgt 1080 

ggaattccat tattaaaaat aaaaattaga gtttgtagtt ttatctggtg atcaatatca 1140 

atatatatta attaaagcaa aaaaaaaaaa aaaaaa 1176 


<210> 2 


1 
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<211> 306 
<212> PRT 

<213> Adonis aestivalis 
<400> 2 

Ala lie Ser Val Phe Ser Thr Ser Tyr Ser Phe His Lys Asn Leu Leu 
15 10 15 

Leu His Ser Lys Gin Asp lie Leu Asn Arg Pro Cys Leu Leu Phe Ser 
20 25 30 

Pro Val Val Val Glu Ser Pro Met Arg Lys Lys Lys Thr His Arg Ala 
35 40 45 

Ala Cys lie Cys Ser Val Ala Glu Arg Thr Arg Asn Leu Asp lie Pro 
50 55 60 

Gin lie Glu Glu Glu Glu Glu Asn Glu Glu Glu Leu lie Glu Gin Thr 
65 70 75 80 

Asp Ser Gly lie lie His lie Lys Lys Thr Leu Gly Gly Lys Gin Ser 

85 90 95 

Arg Arg Ser Thr Gly Ser lie Val Ala Pro Val Ser Cys Leu Gly lie 
100 105 110 

Leu Ser Met lie Gly Pro Ala Val Tyr Phe Lys Phe Ser Arg Leu Met 
115 120 125 

Glu Cys Gly Asp He Pro Val Ala Glu Met Gly He Thr Phe Ala Ala 
130 135 . 140 

Phe Val Ala Ala Ala He Gly Thr Glu Phe Leu Ser Gly Trp Val His 
145 150 155 160 

Lys Glu Leu Trp His Asp Ser Leu Trp Tyr He His Lys Ser His His 
165 170 175 

Arg Ser Arg Lys Gly Arg Phe Glu Phe Asn Asp Val Phe Ala He He 
180 185 190 

Asn Ala Leu Pro Ala He Ala Leu He Asn Tyr Gly Phe Ser Asn Glu 
195 200 205 

Gly Leu Leu Pro Gly Ala Cys Phe Gly Thr Gly Leu Gly Thr Thr Val 
210 215 220 

Cys Gly Met Ala Tyr He Phe Leu His Asn Gly Leu Ser His Arg Arg 

2 
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240 


Phe Pro Val Gly Leu 
245 

Ala Ala His Gin lie 
260 

Gly Leu Phe Leu Gly 
275 

Glu Glu. Leu Glu Arg 
290 

Ser Thr 
305 


lie Ala Asn Val Pro Tyr 
250 

His His Ser Gly Lys Phe 
265 

Pro Gin Glu Leu Glu Glu 
280 

Val lie Ser Arg Thr Ala 
295 


Phe His Lys Leu Ala 
255 

Gin Gly Val Pro Phe 
270 

Val Arg Gly Gly Thr 
285 

Lys Arg Thr Gin Ser 
300 


<210> 3 
<211> 1112 
<212> DNA 

<213> Adonis aestivalis 
<400> 3 

agcaatttca gtgttcagtt caggttattc tttctacaag aeitctct.tgt tggactcaaa 60 

accaaatatt ctcaaacccc catgcctgct attctctcca gttgtgatcci tgtcgcctat 120 

gagaaagaaa aagaaacatg gtgatccatg tatctgctcc qttqcagqqa gaacaaggaa 180 

ccttgatatt cctcaaattg aagaagagga agagaatgtg gaagaactaa tagaacagac 240 

cgattctgac atagtgcata taaagaaaac actagggggg aaacaatcaa aacggcccac 300 

tggctccatt gtcgcacccg tatcttgtct tgggatcctt tcaatgattg gacctgctgt 360 . 

ttacttcaag ttttcacggc taatggaggg tggagatata cctgtagcag aaatggggat 420 

tacgtttgcc acctttgttg ctgctgctgt tggcacggag tttttgtcag catgggttca 480 

caaagaactc tggcacgagt ctttgtggta cattcacaag tctcaccatc ggtcacgaaa 540 

aggccgcttc gagttcaatg atgtgtttgc tattattaac gcgcttcccg ctattgctct 600 

tatcaattat ggattctcca atgaaggcct ccttcctgga gcgtgctttg gtgtcggtct 660 

tggaacaaca gtctgtggta tggcttacat ttttcttcac aatggcctat cacaccgaag 720 

gttcccagta tggcttattg cgaacgtccc ttatttccac aagctggctg cagctcacca 780 

aatacaccac tcaggaaaat ttcagggtgt accatttggc ctgttccttg gacccaagga 840 

attggaagaa gtaagaggag gcactgaaga gttggagagg gtaatcagtc gtacaactaa 900 

acgaacgcaa ccatctacct gaatcaattt ttttacatat ataaggtttt agtttatcgg 960 

tgttataaaa tcacacatcc gtatcgtttt agtaagtcaa agttaagata cttccttctt 1020 

agaatatttt ttgatgtata ggtcgcggat atactgttac actattcgtt gtggaattcc 1080 

attataaaaa aataaaaaaa aaaaaaaaaa aa 1112 

<210> 4 
<211> 306 
<212> PRT 

<213> Adonis aestivalis 
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<400> 4 

Ala He Ser Val 
1 

Leu Asp Ser Lys 
20 

Pro Val Val He 
35 

Pro Cys He Cys 
50 

Gin He Glu Glu 
65 

Asp Ser Asp He 


Lys Arg Pro Thr 
100 

Leu Ser Met He 
115 

Glu Gly Gly Asp 
130 

Phe Val Ala Ala 
145 

Lys Glu Leu Trp 


Arg Ser Arg Lys 
180 

Asn Ala Leu Pro 
195 

Gly Leu Leu Pro 
210 

Cys Gly Met Ala 
225 

Phe Pro Val Trp 


Phe Ser Ser Gly 
5 

Pro Asn He Leu 


Met Ser Pro Met 
40 

Ser Val Ala Gly 
55 

Glu Glu Glu Asn 
70 

Val His He Lys 
85 

Gly Ser He Val 


Gly Pro Ala Val 
120 

He Pro Val Ala 
135 

Ala Val Gly Thr 
150 

His Glu Ser Leu 
165 

Gly Arg Phe Glu 


Ala He Ala Leu 
200 

Gly Ala Cys Phe 
215 

Tyr He Phe Leu 
230 

Leu He Ala Asn 


Tyr Ser Phe Tyr 
10 

Lys Pro Pro Cys 
25 

Arg Lys Lys Lys 


Arg Thr Arg Asn 
60 

Val Glu Glu Leu 
75 

Lys Thr Leu Gly 
90 

Ala Pro Val Ser 
105 

Tyr Phe Lys Phe 


Glu Met Gly He 
140 

Glu Phe Leu Ser 
155 

Trp Tyr He His 
170 

Phe Asn Asp Val 
185 

He Asn Tyr Gly 


Gly Val Gly Leu 
220 

His Asn Gly Leu 
235 

Val Pro Tyr Phe 


Lys Asn Leu Leu 
15 

Leu Leu Phe Ser 
30 

Lys His Gly Asp 
45 

Leu Asp He Pro 


He Glu Gin Thr 
80 

Gly Lys Gin Ser 
55 

Cys Leu Gly He 
110 

Ser Arg Leu Met 
125 

Thr Phe Ala Thr 


Ala Trp Val His 
160 

Lys Ser His His 
175 

Phe Ala He He 
190 

Phe Ser Asn Glu 
205 

Gly Thr Thr Val 


Ser His Arg Arg 
240 

His Lys Leu Ala 
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255 


Ala Ala His Gin lie His His Ser Gly Lys Phe Gin Gly Val Pro Phe 
260 265 270 

Gly Leu Phe Leu Gly Pro Lys Glu Leu Glu Glu Val Arg Gly Gly Thr 
275 280 285 

Glu Glu Leu Glu Arg Val lie Ser Arg Thr Thr Lys Arg Thr Gin Pro 
290 295 300 


Ser Thr 
305 


<210> 5 
<211> 1205 
<212> DNA 

<213> Adonis aestivalis 
<400> 5 

gggctgcagg aattcggcac gagagcaatc tcagtgttca gtacaagtta ttctttccac 60 
aagaatctct tgttgcactc aaaacaagac attctcaacc gcccatgttt gctcttctct 120 
ccagttgtgg tggagtcgcc tatgagaaag aaaaagacac atcgtgctgc atgtatctgc 180 
tctgttgcag agagaacaag gaaccttgat attcctcaaa ttgaagaaga ggaagagaac 240 
gaggaagaac taatagaaca gacggattct ggcataattc atataaagaa aacgctaggg 300 
gggaaacaat caagacggrc cactggctcc attgtcgcac ccgtatcttg tcttgggatc 360 
ctttcaatga tcggacctcc tgtttacttc aagttttcac ggctaatgga gtgtggagat 420 
attcctgtcg cagaaatgcg gattacgttt gccgcctttg ttgctgctgc gattggcacg 4 80 
gaatttttgt caggatggg- tcacaaagaa ctctggcacg attctttgtg gtacattcac 54 0 
aagtctcacc ataggtcacg aaaaggccgc ttcgagttca atgatgtgtt tgctattatt 600 
aacgcgcttc ctgctattcc tcttatcaat tatggattct caaatgaagg cctccttcct 660 
ggagcctgct ttggtacccg tcttggaacg acagtctgtg gcatggctta catttttctt 720 
cacaatggcc tttcacaccg aaggttccca gtagggctta ttgcaaacgt cccttatttc 780 
cacaagctgg ctgcagctCE ccaaatccat cactcaggaa aatttcaggg tgtaccattt 840 
ggcctgttcc ttggacccca ggaattggaa gaagtaagag gaggcactga agaattggag 900 
agggtgatca gtcgtacacc taaacgaacg caatcatcta catgaatcaa ctcttttaca 960 
tttatgaggt tttagtttaz cggtgttaca agtcacacat ttgtgtcgtt gtagtaattc 1020 
aaagttacca tactctttt" tagaattttt ttttgatgta taggtcgcgg agttacggtt 1080 
acaaaggcca aatctattg- tgtggaattc cattattaaa aataaaaatt agagtttgta 1140 
gttttatctg gtgatcaa-a tcaatatata ttaattaaag caaaaaaaaa aaaaaaaaac 1200 
tcgag 2205 

<210> 6 
<211> 315 
<212> PRT 

<213> Adonis aestivalis 


5 
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<400> 6 

Met Gly Leu Gin Glu Phe Gly Thr Arg Ala He Ser Val Phe Ser Thr 

15 10 15 . . 

Ser Tyr Ser Phe His L.ys Asn Leu Leu Leu His Ser Lys Gin Asp He 
20 25 30 

Leu Asn Arg Pro Cys Leu Leu Phe Ser Pro Val Val Val Glu Ser Pro 
35 40 45 

Met Arg Lys Lys Lys Thr His Arg Ala Ala Cys He Cys Ser Val Ala 
50 55 60 

Glu Arg Thr Arg Asn Leu Asp He Pro Gin He Glu Glu Glu Glu Glu 
65 70 75 80 

Asn Glu Glu Glu Leu He Glu Gin Thr Asp Ser Gly He He His He 
85 90 95 

Lys Lys Thr Leu Gly Gly Lys Gin Ser Arg Arg Ser Thr Gly Ser He 
100 105 110 

Val Ala Pro Val Ser Cys Leu Gly He Leu Ser Met He Gly Pro Ala 
115 120 125 

Val Tyr Phe Lys Phe Ser Arg Leu Met Glu Cys Gly Asp He Pro Val 
130 135 140 

Ala Glu Met Gly He Thr Phe Ala Ala Phe Val Ala Ala Ala He Gly 
145 150 155 160 

Thr Glu Phe Leu Ser Gly Trp Val His Lys Glu Leu Trp His Asp Ser 
165 170 175 

Leu Trp Tyr He His Lys Ser His His Arg Ser Arg Lys Gly Arg Phe 
180 185 190 

Glu Phe Asn Asp Val Phe Ala He He Asn Ala Leu Pro Ala He Ala 
195 200 205 

Leu He AsR Tyr Gly Phe Ser Asn Glu Gly Leu Leu Pro Gly Ala Cys 
210 215 220 

Phe Gly Thr Giy Leu Gly Thr Thr Val Cys Gly Met Ala Tyr He Phe 
225 230 235 240 

Leu His Asn Giy Leu Ser His Arg Arg Phe Pro Val Gly Leu He Ala 

245 250 255 
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Asn Val Pro Tyr Phe His Lys Leu Ala 
260 265 

Ser Gly Lys Phe Gin Gly Val Pro Phe 
275 280 

Glu Leu Glu Glu Val Arg Gly Gly Thr 
290 295 

Ser Arg Thr Ala Lys Arg Thr Gin Ser 
305 310 


PCT/US99/10455 

Ala Ala His Gin lie His His 
270 

Gly Leu Phe Leu Gly Pro Gin 
285 

Glu Glu Leu Glu Arg Val lie 
300 

Ser Thr 
315 


<210> 7 
<211> 1141 
<212> DNA 

<213> Adonis aestivalis 

<4 00> 7 

gggctgcagg aattcggcac gagagcaatt tcagtgttca gttcaggtta ttctttctac 60 
aagaatctct tgttggactc aaaaccaaat attctcaaac ccccatgcct gctattctct 120 
ccagttgtga tcatgtcgcc tatgagaaag aaaaagaaac atggtgatcc atgtatctgc 180 
tccgttgcag ggagaacaag gaaccttgat attcctcaaa ttgaagaaga ggaagagaat 24 0 
gtggaagaac taatagaaca gaccgattct gacatagtgc atataaagaa aacactaggg 300 
gggaaacaat caaaacggcc cactggctcc attgtcgcac ccgtatcttg tcttgggatc 360 
ctttcaatga ttggacctgc tgtttacttc aagttttcac ggctaatgga gggtggagat 420 
atacctgtag cagaaatggg gattacgttt gccacctttg ttgctgctgc tgttggcacg 480 
gagtttttgt cagcatgggt tcacaaagaa ctctggcacg agtctttgtg gtacattcac 540 
aagtctcacc atcggtcacg aaaaggccgc ttcgagttca atgatgtgtt tgctattatt 600 
aacgcgcttc ccgctattgc tcttatcaat tatggattct ccaatgaagg cctccttcct 660 
ggagcgtgct ttggtgtcgg tcttggaaca acagtctgtg gtatggctta catttttctt 720 
cacaatggcc tatcacaccg aaggttccca gtatggctta ttgcgaacgt cccttatttc 780 
cacaagctgg ctgcagctca ccaaatacac cactcaggaa aatttcaggg tgtaccattt 840 
ggcctgttcc ttggacccaa ggaattggaa gaagtaagag gaggcactga agagttggag 900 
agggtaatca gtcgtacaac taaacgaacg caaccatcta cctgaatcaa tttttttaca 960 
tatataaggt tttagtttat cggtgttata aaatcacaca tccgtatcgt tttagtaagt 1020 
caaagttaag atacttcctc cttagaatat tttttgatgt ataggtcgcg gatatactgt 1080 
tacactattc gttgtggaat tccattataa aaaaataaaa aaaaaaaaaa aaaaactcga 1140 
^ 1141 

<210> 8 
<211> 315 
<212> PRT 

<213> Adonis aestivalis 
<400> 8 

Met Gly Leu Gin Glu Phe Gly Thr Arg Ala He Ser Val Phe Ser Ser 


RN5«VY:in- <wn Qc»tR<i9A i i > 
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1 5 
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15 


Gly Tyr Ser Phe 
20 

Leu Lys Pro Pro 
35 

Met Arg Lys Lys 
50 

Gly Arg Thr Arg 
65 

Asn Val Glu Glu 


Lys Lys Thr Leu 
100 

Val Ala Pro Val 
115 

Val Tyr Phe Lys 
130 

Ala Glu Met Gly 
145 

Thr Glu Phe Leu 


Leu Trp Tyr lie 
180 

Glu Phe Asn Asp 
195 

Leu lie Asn Tyr 
210 

Phe Gly Val Gly 
225 

Leu His Asn* Gly 


Asn Val Pro Tyr 


Tyr Lys Asn Leu 


Cys Leu Leu Phe 
40 

Lys Lys His Gly 
55 

Asn Leu Asp lie 
70 

Leu lie Glu Gin 
85 

Gly Gly Lys Gin 


Ser Cys Leu Gly 
120 

Phe Ser Arg Leu 
135 

lie Thr Phe Ala 
150 

Ser Ala Trp Val 
165 

His Lys Ser His 


Val Phe Ala lie 
200 

Gly rhe Ser Asn 
215 

Leu Gly Thr Thr 
230 

Leu Ser His Arg 
245 

Phe His Lys Leu 


Leu Leu Asp Ser 
25 

Ser Pro Val Val 


Asp Pro Cys lie 
60 

Pro Gin lie Glu 
75 

Thr Asp Ser Asp 
90 

Ser Lys Arg Pro 
105 

lie Leu Ser Met 


Met Glu Gly Gly 
140 

Thr Phe Val Ala 
155 

His Lys Glu Leu 
170 

His Arg Ser Arg 
185 

lie Asn Ala Leu 


Glu Gly Leu Leu 
220 

Val Cys Gly Met 
235 

Arg Phe Pro Val 
250 

Ala Ala Ala His 


Lys Pro Asn lie 
30 

lie Met Ser Pro 
45 

Cys Ser Val Ala 


Glu Glu Glu Glu 
80 

lie Val His lie 
95 

Thr Gly Ser He 
110 

He Gly Pro Ala 
125 

Asp He Pro Val 


Ala Ala Val Gly 
160 

Trp His Glu Ser 
175 

Lys Gly Arg Phe 
190 

Pro Ala He Ala 
205 

Pro Gly Ala Cys 


Ala Tyr He Phe 
240 

Trp Leu He Ala 
255 

Gin He His His 
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270 


Ser Gly Lys Phe Gin Gly Val Pro Phe Gly Leu Phe Leu Gly Pro Lys 

275 280 285 

Glu Leu Glu Glu Val Arg Gly Gly Thr Glu Glu Leu Glu Arg Val He 
290 295 300 


Ser Arg Thr Thr Lys Arg Thr Gin Pro Ser Thr 
305 310 315 


<210> 9 
<211> 1149 
<212> DNA 

<213> Adonis aestivalis 
<400> 9 

agcaatctca gtgttcac-a caagttattc tttccacaag aatctcttgt tgcactcaaa 60 
acaagacatt ctcaaccccz catgtttgct cttctctcca gttgtggtgg agtcgcctat 120 
gagaaagaaa aagacaca-c gtgctgcatg tatctgctct gttgcagaga gaacaaggaa 180 
ccttgatatt cctcaaatzg aagaagagga agagaacgag gaagaactaa tagaacagac 240 
ggattctggc ataattcata taaagaaaac gctagggggg aaacaatcaa gacggtccac 300 
tggctccatt gtcgcacccg tatcttgtct tgggatcctt tcaatgatcg gacctgctgt 360 
ttacttcaag ttttcacggc taatggagtg tggagatatt cctgtcgcag aaatggggat 420 
tacgtttgcc gcctttgtcg ctgctgcgat tggcacggaa tttttgtcag gatgggttca 480 
caaagaactc tggcacgaT:- ctttgtggta cattcacaag tctcaccata ggtcacgaaa 540 
aggccgcttc gagttcaang atgtgtttgc tattattaac gcgcttcctg ctattgctct 600 
tatcaattat ggattctcaa atgaaggcct ccttcctgga gcctgctttg gtaccggtct 660 
tggaacgaca gtctgtggca tggcttacat ttttcttcac aatggccttt cacaccgaag 720 
gttcccagta gggcttatrg caaacgtccc ttatttccac aagctggctg cagctcacca 780 
aatccatcac tcaggaaaai: ttcagggtgt accatttggc ctgttccttg gaccccagga 840 
attggaagaa gtaagaggag gcactgaaga attggagagg gtgatcagtc gtacagctaa 900 
acgaacgcaa tcatctacat gaatcaactc ttttacattt atgaggtttt agtttatcgg 960 
tgttacaagt cacacatteg tgtcgttgta gtaattcaaa gttaccatac tcttttttag 1020 
aatttttttt tgatgtatag gtcgcggagt tacggttaca aaggccaaat ctattgttgt 1080 
ggaattccat tattaaaaat aaaaattaga gtttgtagtt ttatctggtg atcaatatca 1140 
atatatatt 2149 

<210> 10 

<211> 310 

<212> PRT 

<213> Arabidopsis 

<400> 10 

Met Ala Ala Xaa Leu Ser Thr Ala Val Thr Phe Lys Pro Leu His Arg 
^5 10 15 
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Ser Phe Ser Ser Ser Ser Thr Asp Phe Arg Leu Arg Leu Pro Lys Ser 

20 25 30 

Leu Ser Gly Phe Ser Pro Ser Leu Arg Phe Lys Arg Phe Ser Val Cys.- 
35 40 45 

Tyr Val Val Glu Glu Arg Arg Gin Asn Ser Pro lie Glu Asn Asp Glu 
50 55 - 60 

Arg Pro Glu Ser Thr Ser Ser Thr Asn Ala lie Asp Ala Glu Tyr Leu 
65 70 75 80 

Ala Leu Arg Leu Ala Glu Lys Leu Glu Arg Lys Lys Ser Glu Arg Ser 
85 90 95 

Thr Tyr Leu He Ala Ala Met Leu Ser Ser Phe Gly He Thr Ser Met 
100 105 110 

Ala Val Met Ala Val Tyr Tyr Arg Phe Ser Trp Gin Met G.lu Gly Gly 
115 120 125 

Glu lie Ser Met Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly 
130 135 140 

Ala Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu 
145 150 155 160 

Trp His Ala Ser Leu Trp Asn Met His Glu Ser His His Lys Pro Arg 
165 170 175 

Glu Gly Pro Phe Glu Leu Asn Asp Val Phe Ala He Val Asn Ala Gly 
180 185 190 

Pro Ala He Gly Leu Leu Ser Tyr Gly Phe Phe Asn Lys Gly Leu Val 
195 200 205 

Pro Gly Leu Cys Phe Gly Ala Gly Leu Gly He Thr Val Phe Gly He 
210 215 220 

Ala Tyr Mei: Phe Val His Asp Gly Leu Val His Lys Arg Phe Pro Val 
225 230 235 240 

Gly Pro He Ala Asp Val Pro Tyr Leu Arg Lys Val Ala Ala Ala His 
245 250 255 

Gin Leu His His Thr Asp Lys Phe Asn Gly Val Pro Tyr Gly Leu Phe 
260 265 270 
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Leu Gly Pro Lys Glu Leu Glu Glu Val Gly Gly Asn Glu Giu Leu Asp 
275 280 285 

Lys Glu lie Sex Arg Arg lie Lys Ser Tyr Lys Lys Ala Ser Gly Ser-,-- 
290 295 300 

Gly Ser Ser Ser Ser Ser 
305 310 


<210> 11 
<211> 60 
<212> DNA 

<213> Adonis aestivalis 
<400> 11 

cataccataa atagtagagg acaacctaca aaccaaccac cagaaacctc caatggcagc 60 

<210> 12 
<211> 309 
<212> PRT 

<213> Adonis aestivalis 
<400> 12 

Met Ala Ala Ala lie Ser Val Phe Ser Ser Gly Tyr Ser Phe Tyr Lys 
1 5 10 15 

Asn Leu Leu Leu Asp Ser Lys Pro Asn lie Leu Lys Pro Pro Cys Leu 
20 25 30 

Leu Phe Ser Pro Val Val lie Met Ser Pro Met Arg Lys Lys Lys Lys 
35 40 45 

His Gly Asp Pro Cys lie Cys Ser Val Ala Gly Arg Thr Arg Asn Leu 
50 55 60 

Asp lie Pro Gin lie Glu Glu Glu Glu Glu Asn Val Glu Glu Leu lie 
65 70 75 80 

Glu Gin Thr Asp Ser Asp lie Val His lie Lys Lys Thr Leu Gly Gly 
85 90 95 

Lys Gin Ser Lys Arg Pro Thr Gly Ser lie Val Ala Pro Val Ser Cys 
100 105 110 

Leu Gly lie Leu Ser Met lie Gly Pro Ala Val Tyr Phe Lys Phe Ser 
115 120 125 
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Arg Leu Met Glu Gly Gly Asp lie Pro Val Ala Glu Met Gly lie Thr 
130 135 140 

Phe Ala Thr Phe Val Ala Ala Ala Val Gly Thr Glu Phe Leu Ser Ala. 
145 150 155 160 

Trp Val His Lys Glu Leu Trp His Glu Ser Leu Trp Tyr lie His Lys 
165 170 - 175 

Ser His His Arg Ser Arg Lys Gly Arg Phe Glu Phe Asn Asp Val Phe 
130 185 190 

Ala lie lie Asn Ala Leu Pro Ala lie Ala Leu lie Asn Tyr Gly Phe 
195 200 205 

Ser Asn Glu Gly Leu Leu Pro Gly Ala Cys Phe Gly Val Gly Leu Gly 
210 215 220 

Thr Thr Val Cys Gly Met Ala Tyr lie Phe Leu His Asn Gly Leu Ser 
225 230 235 240 

His Arg Arg Phe Pro Val Trp Leu lie Ala Asn Val Pro Tyr Phe His 
245 250 255 

Lys Leu Ala Ala Ala His Gin lie His His Ser Gly Lys Phe Gin Gly 
260 265 270 

Val Pro Phe Gly Leu Phe Leu Gly Pro Lys Glu Leu Glu Glu Val Arg 
275 280 285 

Gly Gly Thr Glu Glu Leu Glu Arg Val lie Ser Arg Thr Thr Lys Arg 
290 295 300 


Thr Gin Pro Ser Thr 
305 


<210> 13 

<211> 310 

<212> PRT 

<213> Arabidopsis 

<400> 13 

Met Ala Ala Xaa Leu Ser Thr Ala Val Thr Phe Lys Pro Leu His Arg 
15 10 15 

Ser Phe Ser Ser Ser Ser Thr Asp Phe Arg Leu Arg Leu Pro Lys Ser 
20 25 30 
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Leu Ser Gly Phe Ser Pro Ser Leu Arg Phe Lys Arg Phe Ser Val Cys 
35 40 45 

Tyr Val Val Glu Glu Arg Arg Gin Asn Ser Pro He Glu Asn Asp Giu 
50 55 60 

Arg Pro Glu Ser Thr Ser Ser Thr Asn Ala He Asp Ala Glu Tyr Leu 
65 70 75 80 

Ala Leu Arg Leu Ala Glu Lys Leu Glu Arg Lys Lys Ser Glu Arg Ser 
85 90 95 

Thr Tyr Leu He Ala Ala Met Leu Ser Ser Phe Gly He Thr Ser Met 
100 105 110 

Ala Val Met Ala Val Tyr Tyr Arg Phe Ser Trp Gin Met Glu Gly Gly 
115 120 125 

Glu He Ser Met Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly 
130 135 140 

Ala Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu 
145 150 155 . 160 

Trp His Ala Ser Leu Trp Asn Met His Glu Ser His His Lys Pro Arg 
165 170 175 

Glu Gly Pro Phe Glu Leu Asn Asp Val Phe Ala He Val Asn Ala Gly 
180 185 190 

Pro Ala He Gly Leu Leu Ser Tyr Gly Phe Phe Asn Lys Gly Leu Val 
195 200 205 

Pro Gly Leu Cys Phe Gly Ala Gly Leu Gly He Thr Val Phe Gly He 
210 215 220 

Ala Tyr Met Phe Val His Asp Gly Leu Val His Lys Arg Phe Pro Val 
225 230 235 240 

Gly Pro He Ala Asp Val Pro Tyr Leu Arg Lys Val Ala Ala Ala His 
245 250 255 

Gin Leu His His Thr Asp Lys Phe Asn Gly Val Pro Tyr Gly Leu Phe 
260 265 270 


Leu Gly Pro Lys Glu Leu Glu Glu Val Gly Gly Asn Glu Glu Leu Asp 
275 280 285 
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Lys Glu lie Ser Arg Arg lie Lys Ser Tyr Lys Lys Ala Ser Gly Ser 
290 295 300 

Gly Ser Ser Ser Ser Ser 
305 310 


<210> 14 

<211> 305 

<212> PRT 

<213> Arabidopsis 

<400> 14 

Met Ala Ala Gly Leu Ser Thr lie Ala Val Thr Leu Lys Pro Leu Asn 
15 10 15 

Arg Ser Ser Phe Ser Ala Asn His Pro lie Ser Thr Ala Val Phe Pro 
20 25 30 

Pro Ser Leu Arg Phe Asn Gly Phe Arg Arg Arg Lys lie Leu Thr Val 
35 40 45 

Cys Phe Val Val Glu Glu Arg Lys Gin Ser Ser Pro Met Asp Asp Asp 
50 55 60 

Asn Lys Pro Glu Ser Thr Thr Ser Ser Ser Glu lie Leu Met Thr Ser 
65 70 75 80 

Arg Leu Leu Lys Lys Ala Glu Lys Lys Lys Ser Glu Arg Phe Thr Tyr 
85 90 95 

Leu lie Ala Ala Val Met Ser Ser Phe Gly lie Thr Ser Met Ala lie 
100 105 110 

Met Ala Val Tyr Tyr Arg Phe Ser Trp Gin Met Lys Gly Gly Glu Val 
115 120 125 

Ser Val Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly Ala Ala 
130 135 140 

Val Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu Trp 
145 150 155 160 

His Asp Ser Leu Trp Asn Met His Glu Ser His His Lys Pro Arg Glu 
165 170 175 

Gly Ala Phe Glu Leu Asn Asp Val Phe Ala lie Thr Asn Ala Val Pro 
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190 


Ala lie Gly Leu 

195 

Gly Leu Cys Phe 
210 

Tyr Met Phe Val 
225 

Pro lie Ala Asn 


Leu His His Thr 
260 


Leu Tyr Tyr Gly 
200 

Gly Ala Gly Leu 
215 

His Asp Gly Leu 
230 

Val Pro Tyr Leu 
245 

Asp Lys Phe Lys 


Phe Leu Asn Lys 


Gly lie Thr Met 
220 

Val His Lys Arg 
235 

Arg Lys Val Ala 
250 

Gly Val Pro Tyr 
265 


Gly. Leu Val Pro 
205 

Phe Gly Met Ala 


Phe Pro Val Gly 
240 

Ala Ala His Gin 
255 

Gly Leu Phe Leu 
270 


Gly Pro Lys Gin Glu Val Glu Glu Val Gly Gly J.y:: G.Mi Clu Leu Glu 
275 280 2.sr. 

Lys Glu He Ser Arg Arg He Lys Leu Tyr Asn I.ys Gly Ser Ser Thr 
290 295 300 

Ser 
305 


<210> 15 
<211> 315 
<212> PRT 

<213> Capsicum annuum 


<400> 15 

Met Ala Ala Glu He Ser He Ser 
1 5 

Gin Arg Asn Pro Phe Pro Ala Pro 
20 

Leu Leu Phe Phe Ser Pro Leu Thr 

35 40 

Ser Arg Arg Lys Pro Arg Leu Ala 
50 55 

Lys Leu Tyr Thr Ala Gin Ser Gly 
65 70 


Ala Ser Ser Arg Ala He Cys Leu 
10 15 

Lys Tyr Phe Ala Thr Ala Pro Pro 
25 30 

Cys Asn Leu Asp Ala He Leu Arg 
45 

Ala Cys Phe Val Leu Lys Asp Asp 
60 

Lys Gin Ser Asp Thr Glu Ala He 
75 80 
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Gly Asp Glu lie Glu Val Glu Thr Asn Glu Glu Lys Ser Leu Ala Val 
85 90 95 

Arg Leu Ala Glu Lys Phe Ala Arg Lys Lys Ser Glu Arg Phe Thr Tyr-" 
100 105 110 

Leu Val Ala Ala Val Met Ser Ser Leu Gly lie Thr Ser Met Ala Val 
115 120 - 125 

lie Ser Val Tyr Tyr Arg Phe Ser Trp Gin Met Glu Gly Gly Glu Met 
130 135 140 

Pro Phe Ser Glu Met Phe Cys Thr Phe Ala Leu Ala Phe Gly Ala Ala 
145 150 155 160 

lie Gly Met Glu Tyr Trp Ala Arg Trp Ala His Arg Ala Leu Trp His 
165 170 175 

Ala Ser Leu Trp His Met His Glu Ser His His Arg Pro Arg Glu Gly 
180 185 190 

Pro Phe Glu Leu Asn Asp lie Phe Ala lie lie Asn Ala Val Pro Ala 
195 200 205 

lie Ala Phe Phe Ser Phe Gly Phe Asn His Lys Gly Leu lie Pro Gly 
210 215 220 

lie Cys Phe Gly Ala Gly Leu Gly lie Thr Val Phe Gly Met Ala Tyr 
225 230 235 240 

Met Phe Val His- Asp Gly Leu Val His Lys Arg Phe Pro Val Gly Pro 
245 250 255 

lie Ala Lys Val Pro Tyr Phe Gin Arg Val Ala Ala Ala His Gin Leu 
260 265 270 

His His Ser Asp Lys Phe Asp Gly Val Pro Tyr Gly Leu Phe Leu Gly 
275 280 285 

Pro Lys Glu Leu Glu Glu Val Gly Val lie Glu Glu Leu Glu Lys Glu 
290 295 300 


Val Asn Arg Arg lie Lys Ser Leu Lys Arg Leu 
305 310 315 


<210> 16 
<211> 316 
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<212> PRT 

<213> Capsicum annuum 
<400> 16 

Thr Thr Gly Arg Tyr His Tyr Gin Leu Val Trp Cys Gin lie Ser Phe 
15 10 15 

Ser Ser Thr Ser Arg Thr Ser Tyr Tyr Arg ttis Ser Pro Phe Leu Gly 
20 25 30 

Pro Lys Pro Thr Pro Thr Thr Pro Ser Val Tyr Pro lie Thr Pro Phe 
35 40 45 

Ser Pro Asn Leu Gly Ser lie Leu Arg Cys Arg Arg Arg Pro Ser Phe 
50 55 60 

Thr Val Cys Phe Val Leu Glu Asp Asp Lys Phe Lys Thr Gin Phe Glu 
65 70 75 80 

Ala Gly Glu Glu Asp lie Glu Met Lys lie Glu Glu Gin lie Ser Ala 

85 90 95 

Thr Arg Leu Ala Glu Lys Leu Ala Arg Lys Lys Ser Glu Arg Phe Thr 
100 105 110 

Tyr Leu Val Ala Ala Val Met Ser Ser Phe Gly lie Thr Ser Met Ala 
115 120 125 

Val Met Ala Val Tyr Tyr Arg Phe Tyr Trp Gin Met Glu Gly Gly Glu 
130 135 140 

Val Pro Phe Ser Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly Ala 
145 150 155 160 

Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Lys Ala Leu Trp 
165 170 175 

His Ala Ser Leu Trp His Met His Glu Ser His His Lys Pro Arg Glu 
180 185 190 

Gly Pro Phe Glu Leu Asn Asp Val Phe Ala lie lie Asn Ala Val Pro 
195 200 205 

Ala lie Ala Leu Leu Asp Tyr Gly Phe Phe His Lys Gly Leu lie Pro 
210 215 220 

Gly Leu Cys Phe Gly Ala Gly Leu Gly lie Thr Val Phe Gly Met Ala 
225 230 235 240 
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Tyr Met Phe Val 


Pro Val Ala Asn 
260 

Leu His His Ser 
275 

Gly Pro Lys Glu 
290 

Glu Val Asn Arg 
305 


His Asp Gly Leu 
245 

Val Pro Tyr Leu 


Glu Lys Phe Asn 
280 

Leu Glu Glu Val 
295 

Arg 7hr Arg Tyr 
310 


Val His Lys Arg 
250 

Arg Lys Val Ala 
265 

Gly Val Pro Tyr 


Gly Gly Leu Glu 
300 

lie Lys Gly Ser 
315 


Phe Pro Val Gly 
255 

Ala Ala His Ser 
270 

Gly Leu Phe Leu 
285 

Glu Leu Glu Lys 


<210> 17 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 17 

cagaatcggt ctgttctatt agttcttcc 29 

<210> 18 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 18 

caatttgagg aatatcaagg ttccttgttc to 32 
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